智能论文笔记

SafeDrones: Real-Time Reliability Evaluation of UAVs using Executable Digital Dependable Identities

Koorosh Aslansefat , Panagiota Nikolaou , Martin Walker , Mohammed Naveed Akram , Ioannis Sorokos , Jan Reich , Panayiotis Kolios , Maria K. Michael , Theocharis Theocharides , Georgios Ellinas

分类：机器人

2022-07-12

无人驾驶汽车（UAV）的使用提供了各种应用程序的许多优势。但是，安全保证是广泛使用的关键障碍，尤其是考虑到无人机所经历的不可预测的操作和环境因素，这些因素很难仅在设计时间内捕获。本文提出了一种称为SAFEDRONES的新可靠性建模方法，以通过实现无人机的运行时可靠性和风险评估来帮助解决此问题。它是可执行数字可靠身份（EDDI）概念的原型实例化，该概念旨在为多机器人系统的实时，数据驱动的可靠性保证创建基于模型的解决方案。通过提供实时可靠性估算，SAFEDRONES允许无人机以自适应方式相应地更新其任务。

translated by 谷歌翻译

Keep your Distance: Determining Sampling and Distance Thresholds in Machine Learning Monitoring

Al-Harith Farhad , Ioannis Sorokos , Andreas Schmidt , Mohammed Naveed Akram , Koorosh Aslansefat , Daniel Schneider

分类：机器学习 | 人工智能

2022-07-11

机器学习〜（ML）近年来在不同的应用和域上提供了令人鼓舞的结果。但是，在许多情况下，需要确保可靠性甚至安全性等质量。为此，一个重要方面是确定是否在适合其应用程序范围的情况下部署了ML组件。对于其环境开放且可变的组件，例如在自动驾驶汽车中发现的组件，因此，重要的是要监视其操作情况，以确定其与ML组件训练有素的范围的距离。如果认为该距离太大，则应用程序可以选择考虑ML组件结果不可靠并切换到替代方案，例如改用人类操作员输入。 SAFEML是一种基于培训和操作数据集的统计测试的距离测量，用于执行此类监视的模型无形方法。正确设置Safeml的限制包括缺乏用于确定给定应用程序的系统方法，需要多少个操作样本来产生可靠的距离信息以及确定适当的距离阈值。在这项工作中，我们通过提供实用方法来解决这些限制，并证明其在众所周知的交通标志识别问题中的用途，并在一个使用Carla开源汽车模拟器的示例中解决了这些局限性。

translated by 谷歌翻译

MAiVAR: Multimodal Audio-Image and Video Action Recognizer

Muhammad Bilal Shaikh , Douglas Chai , Syed Mohammed Shamsul Islam , Naveed Akhtar

分类：计算机视觉

2022-09-11

当前，根据CNN处理的视频数据，主要执行动作识别。我们研究CNN的表示过程是否也可以通过将基于图像的动作音频表示为任务中的多模式动作识别。为此，我们提出了多模式的音频图像和视频动作识别器（MAIVAR），这是一个基于CNN的音频图像到视频融合模型，以视频和音频方式来实现卓越的动作识别性能。Maivar提取音频的有意义的图像表示，并将其与视频表示形式融合在一起，以获得更好的性能，与大规模动作识别数据集中的两种模式相比。

translated by 谷歌翻译

Thermal Heating in ReRAM Crossbar Arrays: Challenges and Solutions

Kamilya Smagulova , Mohammed E. Fouda , Ahmed Eltawil

分类：机器学习

2022-12-28

Increasing popularity of deep-learning-powered applications raises the issue of vulnerability of neural networks to adversarial attacks. In other words, hardly perceptible changes in input data lead to the output error in neural network hindering their utilization in applications that involve decisions with security risks. A number of previous works have already thoroughly evaluated the most commonly used configuration - Convolutional Neural Networks (CNNs) against different types of adversarial attacks. Moreover, recent works demonstrated transferability of the some adversarial examples across different neural network models. This paper studied robustness of the new emerging models such as SpinalNet-based neural networks and Compact Convolutional Transformers (CCT) on image classification problem of CIFAR-10 dataset. Each architecture was tested against four White-box attacks and three Black-box attacks. Unlike VGG and SpinalNet models, attention-based CCT configuration demonstrated large span between strong robustness and vulnerability to adversarial examples. Eventually, the study of transferability between VGG, VGG-inspired SpinalNet and pretrained CCT 7/3x1 models was conducted. It was shown that despite high effectiveness of the attack on the certain individual model, this does not guarantee the transferability to other models.

translated by 谷歌翻译

PMODE: Prototypical Mask based Object Dimension Estimation

Thariq Khalid , Mohammed Yahya Hakami , Riad Souissi

分类：计算机视觉

2022-12-26

Can a neural network estimate an object's dimension in the wild? In this paper, we propose a method and deep learning architecture to estimate the dimensions of a quadrilateral object of interest in videos using a monocular camera. The proposed technique does not use camera calibration or handcrafted geometric features; however, features are learned with the help of coefficients of a segmentation neural network during the training process. A real-time instance segmentation-based Deep Neural Network with a ResNet50 backbone is employed, giving the object's prototype mask and thus provides a region of interest to regress its dimensions. The instance segmentation network is trained to look at only the nearest object of interest. The regression is performed using an MLP head which looks only at the mask coefficients of the bounding box detector head and the prototype segmentation mask. We trained the system with three different random cameras achieving 22% MAPE for the test dataset for the dimension estimation

translated by 谷歌翻译

Beyond 5G Networks: Integration of Communication, Computing, Caching, and Control

Musbahu Mohammed Adam , Liqiang Zhao , Kezhi Wang , Zhu Han

分类：机器学习

2022-12-26

In recent years, the exponential proliferation of smart devices with their intelligent applications poses severe challenges on conventional cellular networks. Such challenges can be potentially overcome by integrating communication, computing, caching, and control (i4C) technologies. In this survey, we first give a snapshot of different aspects of the i4C, comprising background, motivation, leading technological enablers, potential applications, and use cases. Next, we describe different models of communication, computing, caching, and control (4C) to lay the foundation of the integration approach. We review current state-of-the-art research efforts related to the i4C, focusing on recent trends of both conventional and artificial intelligence (AI)-based integration approaches. We also highlight the need for intelligence in resources integration. Then, we discuss integration of sensing and communication (ISAC) and classify the integration approaches into various classes. Finally, we propose open challenges and present future research directions for beyond 5G networks, such as 6G.

translated by 谷歌翻译

COLT: Cyclic Overlapping Lottery Tickets for Faster Pruning of Convolutional Neural Networks

Md. Ismail Hossain , Mohammed Rakib , M. M. Lutfe Elahi , Nabeel Mohammed , Shafin Rahman

分类：计算机视觉

2022-12-24

Pruning refers to the elimination of trivial weights from neural networks. The sub-networks within an overparameterized model produced after pruning are often called Lottery tickets. This research aims to generate winning lottery tickets from a set of lottery tickets that can achieve similar accuracy to the original unpruned network. We introduce a novel winning ticket called Cyclic Overlapping Lottery Ticket (COLT) by data splitting and cyclic retraining of the pruned network from scratch. We apply a cyclic pruning algorithm that keeps only the overlapping weights of different pruned models trained on different data segments. Our results demonstrate that COLT can achieve similar accuracies (obtained by the unpruned model) while maintaining high sparsities. We show that the accuracy of COLT is on par with the winning tickets of Lottery Ticket Hypothesis (LTH) and, at times, is better. Moreover, COLTs can be generated using fewer iterations than tickets generated by the popular Iterative Magnitude Pruning (IMP) method. In addition, we also notice COLTs generated on large datasets can be transferred to small ones without compromising performance, demonstrating its generalizing capability. We conduct all our experiments on Cifar-10, Cifar-100 & TinyImageNet datasets and report superior performance than the state-of-the-art methods.

translated by 谷歌翻译

LMFLOSS: A Hybrid Loss For Imbalanced Medical Image Classification

Abu Adnan Sadi , Labib Chowdhury , Nursrat Jahan , Mohammad Newaz Sharif Rafi , Radeya Chowdhury , Faisal Ahamed Khan , Nabeel Mohammed

分类：计算机视觉 | 人工智能

2022-12-24

Automatic medical image classification is a very important field where the use of AI has the potential to have a real social impact. However, there are still many challenges that act as obstacles to making practically effective solutions. One of those is the fact that most of the medical imaging datasets have a class imbalance problem. This leads to the fact that existing AI techniques, particularly neural network-based deep-learning methodologies, often perform poorly in such scenarios. Thus this makes this area an interesting and active research focus for researchers. In this study, we propose a novel loss function to train neural network models to mitigate this critical issue in this important field. Through rigorous experiments on three independently collected datasets of three different medical imaging domains, we empirically show that our proposed loss function consistently performs well with an improvement between 2%-10% macro f1 when compared to the baseline models. We hope that our work will precipitate new research toward a more generalized approach to medical image classification.

translated by 谷歌翻译

An Adaptive Simulated Annealing-Based Machine Learning Approach for Developing an E-Triage Tool for Hospital Emergency Operations

Abdulaziz Ahmed , Mohammed Al-Maamari , Mohammad Firouz , Dursun Delen

分类：人工智能

2022-12-22

Patient triage at emergency departments (EDs) is necessary to prioritize care for patients with critical and time-sensitive conditions. Different tools are used for patient triage and one of the most common ones is the emergency severity index (ESI), which has a scale of five levels, where level 1 is the most urgent and level 5 is the least urgent. This paper proposes a framework for utilizing machine learning to develop an e-triage tool that can be used at EDs. A large retrospective dataset of ED patient visits is obtained from the electronic health record of a healthcare provider in the Midwest of the US for three years. However, the main challenge of using machine learning algorithms is that most of them have many parameters and without optimizing these parameters, developing a high-performance model is not possible. This paper proposes an approach to optimize the hyperparameters of machine learning. The metaheuristic optimization algorithms simulated annealing (SA) and adaptive simulated annealing (ASA) are proposed to optimize the parameters of extreme gradient boosting (XGB) and categorical boosting (CaB). The newly proposed algorithms are SA-XGB, ASA-XGB, SA-CaB, ASA-CaB. Grid search (GS), which is a traditional approach used for machine learning fine-tunning is also used to fine-tune the parameters of XGB and CaB, which are named GS-XGB and GS-CaB. The six algorithms are trained and tested using eight data groups obtained from the feature selection phase. The results show ASA-CaB outperformed all the proposed algorithms with accuracy, precision, recall, and f1 of 83.3%, 83.2%, 83.3%, 83.2%, respectively.

translated by 谷歌翻译

The Internet of Senses: Building on Semantic Communications and Edge Intelligence

Roghayeh Joda , Medhat Elsayed , Hatem Abou-zeid , Ramy Atawia , Akram Bin Sediq , Gary Boudreau , Melike Erol-Kantarci , Lajos Hanzo

分类：人工智能

2022-12-21

The Internet of Senses (IoS) holds the promise of flawless telepresence-style communication for all human `receptors' and therefore blurs the difference of virtual and real environments. We commence by highlighting the compelling use cases empowered by the IoS and also the key network requirements. We then elaborate on how the emerging semantic communications and Artificial Intelligence (AI)/Machine Learning (ML) paradigms along with 6G technologies may satisfy the requirements of IoS use cases. On one hand, semantic communications can be applied for extracting meaningful and significant information and hence efficiently exploit the resources and for harnessing a priori information at the receiver to satisfy IoS requirements. On the other hand, AI/ML facilitates frugal network resource management by making use of the enormous amount of data generated in IoS edge nodes and devices, as well as by optimizing the IoS performance via intelligent agents. However, the intelligent agents deployed at the edge are not completely aware of each others' decisions and the environments of each other, hence they operate in a partially rather than fully observable environment. Therefore, we present a case study of Partially Observable Markov Decision Processes (POMDP) for improving the User Equipment (UE) throughput and energy consumption, as they are imperative for IoS use cases, using Reinforcement Learning for astutely activating and deactivating the component carriers in carrier aggregation. Finally, we outline the challenges and open issues of IoS implementations and employing semantic communications, edge intelligence as well as learning under partial observability in the IoS context.

translated by 谷歌翻译